THE IMAGES OF NON-COMMUTATIVE POLYNOMIALS 
EVALUATED ON 2 x 2 MATRICES. 

(3 i ALEXEY KANEL-BELOV, SERGEY MALEV, LOUIS ROWEN 

^^' Abstract. Let p be a multilinear polynomial in several non-commuting vari- 

^ ' ables with coefficients in a quadratically closed field K of any characteristic. 

It has been conjectured that for any n, the image of p evaluated on the set 
Mn{K) of n by n matrices is either zero, or the set of scalar matrices, or the 
set sln{K) of matrices of trace 0, or all of Mn{K). We prove the conjecture 
for n = 2, and show that although the analogous assertion fails for completely 
homogeneous polynomials, one can salvage the conjecture in this case by in- 
. eluding the set of all non-nilpotent matrices of trace zero and also permitting 

C J • dense subsets of Mn{K). 

d . 1. Introduction 

Images of polynomials evaluated on algebras play an important role in non- 
commutative algebra. In particular, various important problems related to the 
CO ' theory of polynomial identities have been settled after the construction of central 

^ \ polynomials by Formanek |F1| and Razmyslov |Ralj . 

The parallel topic in group theory (the images of words in groups) also has been 

studied extensively, particularly in recent years. Investigation of the image sets of 

C^ • words in pro-p-groups is related to the investigation of Lie polynomials and helped 

Zelmanov |Ze| to prove that the free pro-p-group cannot be embedded in the algebra 
f— ^ ' of n X n matrices when p ^ n. (For p > 2, the impossibility of embedding the free 

(^ I pro-p-group into the algebra of 2 x 2 matrices had been proved by Zubkov |Zu).) 

The general problem of nonlinearity of the free pro-p-group is related on the one 
hand with images of Lie polynomials and words in groups, and on the other hand 
, . ■ with problems of Specht type, which is of significant current interest. 

r> I Borel B (also cf. La ) proved that for any simple (semisimple) algebraic group G 

j^ ■ and any word w of the free group on r variables, the word map w : G"" — ^ G is 

dominant. Larsen and Shalev |LaSj showed that any element of a simple group can 
be written as a product of length two in the word map, and Shalev |S] proved Ore's 
conjecture, that the image of the commutator word in a simple group is all of the 
group. 

In this note we consider the question, reputedly raised by Kaplansky, of the 
possible image set Im p of a polynomial p on matrices. When p = a;ia;2 — 2^2X1, 
this is a theorem of Albert and Muckenhoupt [AIM] . For an arbitrary polynomial, 
the question was settled for the case when i^ is a finite field by Chuang [Chi, who 



p 

in 



This work was financially supported by the Israel Science Foundation (grant no. 1178/06). 
The second named author was supported by an Israeli Ministry of Immigrant Absorbtion schol- 
arship. 

We are grateful to Kulyamin, Latyshev, Mihalev, E. Plotkin, and L. Small for useful comments. 
Latyshev and Mihalev indicated that the problem was originally posed by Kaplansky. 

1 



2 ALEXEY KANEL-BELOV, SERGEY MALEV, LOUIS ROWEN 

proved that a subset S C Mn{K) containing is the image of a polynomial with 
constant term zero, if and only if S is invariant under conjugation. Later Chuang's 
result was generalized by Kulyamin [Kulj . |Ku2 ! for graded algebras. 

Chuang [Ch also observed that for an infinite field K, if Im p consists only 
of nilpotent matrices, then p is a polynomial identity (PI). This can be seen via 
Amitsur's Theorem [Row' Theorem 3.26, p. 176] that says that the relatively free 
algebra of generic matrices is a domain. Indeed, p" must be a PI for AIn{K), 
implying p is a PI. 

Lee and Zhou proved [LeZhl Theorem 2.4] that when K is an infinite division 
ring, for any non-identity p with coefficients in the center of K , Im p contains an 
invertible matrix. 

Over an infinite field, it is not difficult to ascertain the linear span of the values 
of any polynomial. Indeed, standard multilinearization techniques enable one to 
reduce to the case where the polynomial p is multilinear, in which case the linear 
span of its values comprise a Lie ideal since, as is well-known, 

[a,p(ai,...,a„)] =p([a, ai],a2 . . . ,a„)-fp(ai, [0,02] ...,an)-\ hp(ai,..., [a, an]), 

and Herstein [Her] characterized Lie ideals of a simple ring R as either being con- 
tained in the center or containing the commutator Lie ideal [R, R] . Another proof 
is given in |BKj ; also see Lemma [5] below. It is considerably more difficult to 
determine the actual image set Im p, rather than its linear span. 
Thus, in |Dn| . Lvov formulated Kaplansky's question as follows: 



Question 1. (I. Lvov) Let p be a multilinear polynomial over a field K . Ls the set 
of values of p on the matrix algebra Mn{K) a vector space? 

In view of the above discussion. Question [1] is equivalent to the following: 

Conjecture 1. Ifp is a multilinear polynomial evaluated on the matrix ring Mn{K), 
then Im p is either {0}, K , sln{K), or Mn{K). Here K is the set of scalar matrices 
and sin {K) is the set of matrices of trace zero. 

Example 1. Imp can indeed equal {0}, K, sln{LC), or Mn{K). For example, if 
our polynomial is in one variable and p{x) = x then Im p = Mn{K). The image 
of the polynomial [a;i,a;2] is sln{LC)- If the polynomial p is central, then its image 
is K and examples of such polynomials can be found in [Ral| and in [Flj . Finally 
if the polynomial p is a PI, then its image is {0}, and S2n is an example of such 
polynomial. 

As noted above, the conjecture fails for non-multilinear polynomials when X is a 
finite field. The situation is considerably subtler for images of non-multilinear, com- 
pletely homogeneous polynomials than for multilinear polynomials. Over any field 
-ftr, applying the structure theory of division rings to Amitsur's theorem, it is not 
difficult to get an example of a completely homogeneous polynomial /, noncentral 
on M'i{K), whose values all have third powers central; clearly its image does not 
comprise a subspace of M^{K). Furthermore, in the (non- multilinear) completely 
homogeneous case, the set of values could be dense without including all matri- 
ces. (Analogously, although the finite basis problem for multilinear identities is 
not yet settled in nonzero characteristic, there are counterexamples for completely 
homogeneous polynomials, cf. [B].) 
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Our main results in this note are for n = 2, for which we settle Conjecture [l] 
proving the following results (see f}2]for terminology). We call a field K quadratically 
closed if every nonconstant polynomial of degree < 2 degp in K[x] has a root in K. 

Theorem 1. Letp{xi, . . . ,Xm) be a semi-homogeneous polynom,ial (defined below) 
evaluated on the algebra M2{K) o/2 x 2 matrices over a quadratically closed field. 
Then Im p is either {0}, K , the set of all non-nilpotent matrices having trace zero, 
sl2{K), or a dense subset of M2{K) (with respect to Zariski topology). 

(We also give examples to show how p can have these images.) 

Theorem 2. If p is a multilinear polynomial evaluated on the matrix ring M2{K) 
(where K is a quadratically closed field) , then Im p is either {0}, K , sh, or M2{K) . 

Whereas for 2x2 matrices one has a full positive answer for multilinear poly- 
nomials, the situation is ambiguous for homogeneous polynomials, since, as we 
shall see, certain invariant sets cannot occur as their images. For the general non- 
homogeneous case, the image of a polynomial need not be dense, even if it is non- 
central and takes on values of nonzero trace, as we see in ExampleO In this paper, 
we start with the homogeneous case (which includes the completely homogeneous 
case, then discuss the nonhomogeneous case, and finally give the complete picture 
for the multilinear case. 

The proofs of our theorems use some algebraic-geometric tools in conjunction 
with ideas from graph theory. The final part of the proof of Theorem [2] uses the 
First Fundamental Theorem of Invariant Theory (that in the case Char K — {), 
invariant functions evaluated on matrices are polynomials involving traces) , proved 
by Helling [Helj . Procesi [P], and Razmyslov |Ra3) . The formulation in positive 
characteristic, due to Donkin [D], is somewhat more intricate. GL„ (iiT) acts on 
TO— tuples of n X n-matrices by simultaneous conjugation. 

Theorem (Donkin [D ). For any m,n G N, the algebra of polynomial invariants 
K[Mn{K)™']'~'^"'^^' under GLn{K) is generated by the trace functions 

Ti^j{xi,X2,...,x„r) ^Trace^Xi^Xi^- ■■Xi^,/^ K"-), (1) 

where i = (ii, . . . , v), all ii < m, r ^ N,) > 0, and Xi^Xi^ • • ■ Xi^ acts as a linear 
transformation on the exterior algebra f\^ K" . 

Remark. For n = 2 we have a polynomial function in expressions of the form 
Trace(A, /\ K^) and It A where A is monomial. Note that Trace(A,/\ K"^) = 
detA. 

(The Second Fundamental Theorem, dealing with relations between invariants, 
was proved by Procesi |P] and Razmyslov |Ra3| in the case Char K — and by 
Zubkov [Zu' in the case Char K > 0.) 

Other works on polynomial maps evaluated on matrix algebras include |W) . 
[GKj . who investigated maps that preserve zeros of multilinear polynomials. 

2. Definitions and basic preliminaries 

By K{xi, . . . ,Xm) we denote the free iT-algebra generated by noncommuting 
variables xi, . . . ,Xm, and refer to the elements of K{xi, . . . ,Xm) as polynomials. 
Consider any algebra R over a field K. A polynomial p g K{xi, . . . , Xm) is called a 
polynomial identity (PI) of the algebra R if p(ai, . . . , Um) = for all ai, . . . , a™ € R; 
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p £ K{xi, . . . , Xm) is a central polynomial of R, if for any ai, . . . , Om G i? one has 
p{ai, . . . , a,n) S Cent(i?) but p is not a PI of i?. A polynomial p (written as a sum of 
monomials) is called semi-homogeneous of weighted degree d with (integer) weights 
{wi , . . . , Wm) if for each monomial h of p, taking dj to be the degree of Xj in p, we 
have 

diWi + • • • + dnWn = d. 
A semi-homogeneous polynomial with weights (1, 1, . . . , 1) is called homogeneous of 
degree d. 

A polynomial p is completely homogeneous of multidegree (di, . . . ,cim) if each 
variable Xi appears the same number of times di in all monomials. A polynomial 
p € K{xi, . . . , Xm) is called multilinear of degree m if it is linear (i.e. homogeneous 
of multidegree (1,1,..., 1)). Thus, a polynomial is multilinear if it is a polynomial 
of the form 

P(Xi , . . . , X„i) / ^ CcrXCT(l) • • • 3:^CT(m) 7 

where S"™ is the symmetric group in m letters and the coefficients Ca- are constants 
inK. 

We need a slight modification of Amitsur's theorem, which is well known: 

Proposition 1. The algebra of generic matrices with traces is a domain which can 
be embedded in the division algebra UD of central fractions of Amitsur's algebra 
of generic matrices. Likewise, all of the functions in Donkin's theorem can be 
embedded in UD. 

Proof. Any trace function can be expressed as the ratio of two central polynomials, 

in view of [Row[ Theorem 1.4.12]; also see |BR1 Theorem J, p. 27] which says for any 

characteristic coefficient at of the characteristic polynomial A*-|-^^^;^(— l)'^afcA'~'^ 

that 

t 

afc/(ai,...,at,ri,...,r„) ^ ^ /(r'^^ai, . . . ,r'"at, ri, . . . ,r„), (2) 

k=l 

summed over all vectors (fci, . . . ,kt) where each ki G {0, 1} and ^ fc^ = i, where / 
is any i— alternating polynomial (and t = n'^). In particular, 

t 
tr(r)/(ai,. . . ,af,ri, . . . ,r„j) = 2j/(ai, ■ • • ,ak-i,Tak,ak+i, . . . ,af,ri,. . .,rm), 

fc=i 

(3) 
so any trace of a polynomial belongs to UD. In general, the function ([T]) of Donkin's 
theorem can be written as a characteristic equation, so we can apply Equation ([2|) . 

D 

Here is one of the main tools for our investigation. 

Definition 1. A cone oi Mn{K) is a subset closed under multiplication by nonzero 
constants. An invariant cone is a cone invariant under conjugation. An invariant 
cone is irreducible if it does not contain any nonempty invariant cone. 

Example 2. Examples of invariant cones of Mn{K) include: 

(i) The set of diagonalizable matrices, 
(ii) The set of non-diagonalizable matrices, 
(iii) The set K of scalar matrices. 
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(iv) The set of nilpotcnt matrices, 
(v) The set sin of matrices having trace zero. 

3. Images of Polynomials 

For any polynomial p € K{xi, . . . ,Xm), the image of p (in K) is defined as 

Im p = {A G R : there exist oi, . . . , am G R such that p{ai, . . . , «„) — A\. 

Remark 1. Im p is invariant under conjugation, since 

ap{xi, . . . , Xm)oi^'^ — p{axia~^ , ax2a~^ , ■ ■ ■ , aXmCe~^) G Im p, 

for any nonsingular a £ Mn{K). 

Lemma 1. //Char K does not divide n, then any non-identity p{xi, ... ,Xm) of 
Mn{K) must either be a central polynomial or take on a value which is a matrix 
whose eigenvalues are not all the same. 

Proof. Otherwise p{xi, . . . , x^) ~ — tr(p(a;i, . . . , Xm)) is a nilpotent element in the 
algebra of generic matrices with traces, so by Proposition [T] is 0, implying p is 
central. D 

Let us continue with the following easy but crucial lemma. 

Lemma 2. Suppose the field K is closed under d-roots. If the image of a semi- 
homogeneous polynomial p of weighted degree d intersects an irreducible invariant 
cone C nontrivially, then C C Im p. 

Proof. If A G Im p then A — p{xi, . . . ,Xrn) for some Xi G Mn{K). Thus for any 
c £ K, cA = p{c^^/''-xi,c^^/^X2, ■ . ■ ,c^"^^'^Xm) £ Im p, where (wi, . . . ,«;,„) are the 
weights. This shows that Im p is a cone. D 

Remark 2. When the polynomial p is multilinear, we take the weights wi = I and 
Wi — for all i > 1, and thus do not need any assumption on K to show that the 
image of any multilinear polynomial is an invariant cone. 

Lemma 3. // Im p consists only of diagonal matrices, then the image Im p is 
either {0} or the set K of scalar matrices. 

Proof. Suppose that some nonscalar diagonal matrix A — DiagjAi, . . . , A„} is in 
the image. Therefore Xi ^ Xj for some i and j. The matrix A' = A + Cij (here e^ is 
the matrix unit) is conjugate to A so by Remark [T] also belongs to Im p. However 
A' is not diagonal, a contradiction. D 

Lemma 4. Assume that the Xi are matrix units. Then p{xi, . . . ,Xm) is either 0, 
or c ■ Cij for some i ^ j , or a diagonal matrix. 

Proof. Suppose that the Xi are matrix units eu^^ii. Then the product xi • • • Xm is 
nonzero if and only if li = ki^i for each z, and in this case this product is equal to 
^ki.irn- If ^i ^^^ such that there is at least one a £ 5„ such that ^^(i) • • • Xcr[m) is 
nonzero then we can consider a graph on n vertices whereby we connect vertex i 
with vertex j by an oriented edge if there is a matrix e^ in our set {a;i, a;2, . . . , Xm}- 
It can happen that we will have more than one edge that connects i to j and it is 
also possible that we will have edges connecting a vertex to itself. The evaluation 
p{xi, . . . , Xm) 7^ only if there exists an Eulerian cycle or an Eulerian path in the 
graph. This condition is necessary but is not sufficient. From graph theory we 
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know that there exists an Eulerian path only if the degrees of ah vertices but two 
are even, and the degrees of these two vertices are odd. Also we know that there 
exists an Eulerian cycle only if the degrees of all vertices are even. Thus when 
p{xi, . . . , Xm) 7^ 0, there exists either an Eulerian path or cycle in the graph. In the 
first case we have exactly two vertices of odd degree such that one of them (i) has 
more output edges and another (j) has more input edges. Thus the only nonzero 
terms in the sum of our polynomial can be of the type ccij and therefore the result 
will also be of this type. In the second case all degrees are even. Thus there are 
only cycles and the result must be a diagonal matrix. D 

As mentioned earlier, the following result follows easily from [Her], with another 
proof given in [BKJ, but a self-contained proof is included here for completeness. 

Lemma 5. // the image of p is not {0} or the set of scalar matrices then for any 
i ^ j the matrix unit Cij belongs to Im p. The linear span of L ~ Im p must he 
either {Q\, K, sin, or Mn{K). 

Proof. Assume that the image is neither {0} nor the set of scalar matrices. Then by 
LemmaOthe image contains a nondiagonal matrix p(a;i, . . . , Xm) = A. Any Xi is a 
linear combination of matrix units. After opening brackets on the left hand side we 
will have a linear combination of evaluations of p on matrix units, and on the right 
hand side a nondiagonal matrix. From Lemma HI it follows that any evaluation of p 
on matrix units is either diagonal or a matrix unit multiplied by some coefficient. 
Thus there is a matrix e^ for i ^ j in Im p. Since any nondiagonal Cki is conjugate 
to Cij, all nondiagonal matrix units belong to the image. Thus all matrices with 
zeros on the diagonal belong to the linear span of the image. Taking matrices 
conjugate to these, we obtain sZ„ C L. Thus L must be either sin or M„. D 

3.1. The case M2{K). Now we consider the case n = 2. We start by introducing 
the cones of main interest to us, drawing from Example [2] 

Example 3. 

(i) The set of nonzero nilpotent matrices comprise an irreducible invariant 
cone, since these all have the same minimal and characteristic polyno- 
mial x'^ . 

(ii) The set of nonzero scalar matrices is an irreducible invariant cone. 

(iii) Let K denote the set of non- nilpotent, non-diagonalizable matrices in M2 (K) . 
Note that A d K precisely when A is non-scalar, but with equal nonzero 
eigenvalues, which is the case if and only if A is the sum of a nonzero scalar 
matrix with a nonzero nilpotent matrix. These are all conjugate when the 
scalar part is the identity, i.e., for matrices of the form 

1 a 



1/' "^0 

since these all have the same minimal and characteristic polynomials, namely 
x^ — 2a; + 1. It follows that K is an irreducible invariant cone. 
(iv) Let K denote the set of non- nilpotent matrices in M2 (K) that have trace 
zero. 

When Char K ^ 2, K is an irreducible invariant cone, since any such 

A 



matrix has distinct eigenvalues and thus is conjugate to j „ , 
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When Char K — 2, K is an irreducible invariant cone, since any such 



A 

(v) sl2{K) \ {0} is the union of the two irreducible invariant cones of (i) and 
(iv). (The cases Char K ^2 and Char K = 2 are treated separately.) 

(vi) Let C denote the set of nonzero matrices which are the sum of a scalar and 
a nilpotent matrix. Then C is the union of the following three irreducible 
invariant cones: The nonzero scalar matrices, the nilpotent matrices, and 
the nonzero scalar multiples of non-identity unipotent matrices. (All non- 
identity unipotent matrices are conjugate.) 

From now on, we assume that X is a quadratically closed field. In particular, 
all of the eigenvalues of a matrix A £ M2(K) lie in K. One of our main ideas is 
to consider some invariant of the matrices in Im (p), and study the corresponding 
invariant cones. Here is the first such invariant that we consider. 

Remark 3. Any non-nilpotcnt 2x2 matrix A over a quadratically closed field has 
two eigenvalues Ai and A2 such that at least one of them is nonzero. Therefore one 
can define the ratio of eigenvalues, which is well-defined up to taking reciprocals: 
yi and -j^. Thus, we will say that two non-nilpotent matrices have different ratios 
of eigenvalues if and only if their ratios of eigenvalues are not equal nor reciprocal. 
We do have a well-defined mapping 11 : M2{K) — > K given by A H> j^ + ^■ 
This mapping is algebraic because 

Ai , A2 _ , ^ jtvA)^ 
A2 Ai dctA ' 

Remark 4. The set of non-scalar diagonalizable matrices with a fixed nonzero 
ratio r of eigenvalues (up to taking reciprocals) is an irreducible invariant cone. 
Indeed, this is true since any such diagonalizable matrix is conjugate to 

1 



A, „ 
r 

3.2. Images of semi-homogeneous polynomials. We are ready to prove The- 
orem [1] 

Lemma 6. Suppose K is closed under d-roots, as well as being quadratically closed. 
If the image Im p of a semi-homogeneous polynomial p of weighted degree d contains 
an element of K , then Im p contains all of K . 

Proof. This is clear from Lemma HKiii) together with Example |31 since K is an 
irreducible invariant cone. D 

Proof of Theorem [TJ Assume that there are matrices p{xi, . . . ,Xm) and 
p{yi, ■ . . ,ym) with different ratios of eigenvalues in the image of p. Consider the 
polynomial matrix /(t) = p{txi + (1 - t)yi,tx2 + (1 - t)y2, . . . , txm + (1 - t)yr,i), 
and Ho / where 11 is defined in Remark [3] Write this nonconstant rational function 

^^^4 in lowest terms as -gW, where A{t),B{t) are polynomials of degree < 2degp 
in the numerator and denominator. 

An element c G K is in Im (11 o /) iff there exists t such that A — cB = {If for 
some t* A{t*) — cB(t*) = 0, then t* would be a common root of A and B). Let dc = 
deg(^ — cB). Then dc < max(deg ^, deg B) < 2degp, and dc = max(deg A,degi?) 
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for almost all c. Hence, the polynomial A — cB is not constant and thus there is a 
root. Thus the image of -grrr is Zariski dense, implying the image of -r-rt is Zariski 
dense. 

Hence, we may assume that Im p consists only of matrices having a fixed ratio 
r of eigenvalues. If r =/= ±1, the eigenvalues Ai and A2 are linear functions of 
tr p{xi . . . , Xm)- Hence Ai and A2 are the elements of the algebra of generic matrices 
with traces, which is a domain by Proposition [TJ But the two nonzero elements 
p — Xil and p — X2I have product zero, a contradiction. 

We conclude that r — ±1. First assume r = 1. If Char K y^ 2, then p is a PI, 
by Lemma[T] If Char K = 2 then the image is either sl2{K) or K, by Example[3Jv). 

Thus, we may assume r = — 1 and Char K ^ 2. Hence, Im p consists only of 
matrices with Ai = — A2. By Lemma[l] there is a non-nilpotent matrix in the image 
of p. Hence, by Example |3l[v) , Im p is either K or strictly contains it and is all of 
sl2{K). " U 

We illuminate this result with some examples to show that certain cones may be 
excluded. 

Example 4. 

(i) The polynomial g{x\^X2) — \x\^X2^ has the property that g{A,B) = 
whenever A is scalar, but g can take on a nonzero value whenever A is non- 
scalar. Thus, g{xi, a;2)a;i takes on all values except scalars. This polynomial 
is completely homogeneous, but not multilinear. (One can linearize in X2 to 
make g linear in each variable except xi , and the same idea can be applied 
to Formanek's construction jFlj of a central polynomial for any n.) 

(ii) Let S be any finite subset of K. There exists a completely homogeneous 
polynomial p such that Im p is the set of all 2 x 2 matrices except the 
matrices with ratio of eigenvalues from S. The construction is as follows. 
Consider 

f{x) = x-Y[{Xi-X26){\2-Xi6), 
ses 
where Ai^2 are eigenvalues of x. For each S the product (Ai — A2(5)(A2 — 
XiS) is a polynomial of tr x and tr x^. Thus f{x) is a polynomial with 
traces, and, as noted above (by IRowl Theorem 1.4.12]), one can rewrite 
each trace in / as a fraction of multilinear central polynomials (see ([3]) in 
Proposition [T]). After that we multiply the expression by the product of 
all the denominators, which we can take to have value 1. We obtain a 
completely homogeneous polynomial p which image is the cone under Im / 
and thus equals Im /. The image of p is the set of all non-nilpotent matrices 
with ratios of eigenvalues not belonging to S. 

(iii) The image of a completely homogeneous polynomial evaluated on 2 x 2 
matrices can also be K. Take f{x, y) — [x, y]^ . This is the product of [x, y^ 
and [x,y]. [x,yY is a central polynomial, and therefore tr / = 0. However, 
there are no nilpotent matrices in Im p because if [A, B]^ is nilpotent then 
[A, S] (which is a scalar multiple of [A, i?]^) is nilpotent and therefore 
[A, BY =Oand [A,Bf =0. 

(iv) Consider the polynomial 

p{xi,X2,yi,y2) = [{xiX2f , {yiy2ff + [{xiX2f , {yiy2f][xiyi, X2y2f ■ 
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Then p takes on all scalar values (since it becomes central by specializ- 
ing xi 1—^ X2 and yi ^-> 2/2), but also takes on all nilpotent values, since 
specializing xi t-^ / + ei2, X2 >— > 622, and j/i n- ei2, and ^2 *— >■ £21 sends p to 

[(ei2 + 622)^,611]^ + [(ei2 + e22)^,e^i][ei2,e2i] = - 612(611 - 622) = 612. 

We claim that Imp does not contain any matrix a — p{xi,X2,yi,y2) 
in K. Otherwise, the matrix [{xiX2)'^,{yiy2)'^][xiyi,X2y2]'^ would be the 
difference of a matrix having equal eigenvalues and a scalar matrix, but 
of trace 0, and so would have both eigenvalues and thus be nilpotent. 
Thus [(5;i2;2)^, (2/12/2)^] would also be nilpotent, implying the scalar term 
[{xiX2)^, (2/1^2)^]^ equals zero, implying a is nilpotent, a contradiction. 

Im p also contains all matrices having two distinct eigenvalues. We con- 
clude that Im p = M2iK) \ K. 

Remark 5. In Example 4(iv), The intersection S of Im p with the discriminant 
surface is defined by the polynomial tr(p(a;i, . . . ,a;,„))^ — 4det(p(xi, . . . , a;„j)) — 
(Ai — A2)^. S is the union of two irreducible varieties (its scalar matrices and the 
nonzero nilpotent matrices), and thus S is a reducible variety. Thus, we see that 
the discriminant surface of a polynomial p of the algebra of generic matrices can be 
reducible, even if it is not divisible by any trace polynomial. Such an example could 
not exist for p multilinear, since then, by the same sort of argument as given in 
the proof of Theorem [TJ the discriminant surface would give a generic zero divisor 
in Amitsur's universal division algebra UD of Proposition [TJ a contradiction. In 
fact, we will also see that the image of a multilinear polynomial cannot be as in 
Example 4(iv). 

3.3. Images of non-homogeneous polynomials. Now we consider briefly the 
general case. One can write any polynomial p{xi, . . . , Xm) as p = hk + ■ ■ ■ + hn, 
where the hi are semi- homogeneous polynomial of weighted degree i. 

Proposition 2. Notation as above, assume that there are weights (wi, . . . ,Wm) 
that the polynomial /i„ has image dense in M2{K). Then Im p is dense in M2{K). 

Proof. Consider 

n 

p(Ara;i,.-.,A>™) = ^/i.y. 

One can write P — A~"p(A™a;i, . . . , A™Xm) as a polynomial in xi,...,Xm and 
e — J. The matrix polynomial is the set of four polynomials Pi,i,Pi, 2,^2,1,^2, 2, 
which we claim are independent. If there is some polynomial h in four variables 
such that h{pii, pi2, P2,1t P2,2) = then h should vanish on four polynomials of P 
for each e, in particular for e = 0, a contradiction. D 

Remark 6. The case remains open where p{xi, . . . , Xm) is a polynomial for which 
there are no weights (wi, . . . , Wm) such that one can write p = hk + ■ ■ ■ + hn, where 
hi is semi- homogeneous of weighted degree i and /i„ has image dense in M2. 

Example 5. For Char K ^ 2 we give an example of such a polynomial whose 
middle term has image dense in M2{K). Take the polynomial f{x,y) — [x,y] + 
[x,y]'^. It is not hard to check that Im / is the set of all matrices with eigenvalues 
e^ + cande^ — c. Consider p(ai, a2,/3i,/32) = /(ai +/3^,a2 +/3|). The polynomials 
/ and p have the same images. Now let us open the brackets. The term of degree 



10 ALEXEY KANEL-BELOV, SERGEY MALEV, LOUIS ROWEN 

4 is /i4 = [ai,a2]^ + [Pijl^i]- The image of ft.4 is all of M2{K), because [01,02]^ 
can be any scalar matrix and [/?i,/3|] can be any trace zero matrix. However the 
image of p is the set of all matrices with eigenvalues c^ + c and c^ — c. 

3.4. Images of multilinear polynomials. 

Lemma 7. If A, B € Im p have different ratios of eigenvalues, then Im p contains 
matrices having arbitrary ratios of eigenvalues j^ G K. 

Proof. If ^ = p{xi, . . . ,Xm), B = p(j/i, ... ,?/,„) e Imp have different ratios of 
eigenvalues, then we can lift the xi, . . . , a;™, 2/i, ■ • ■ , Vm to generic matrices, and then 
p(xi, . . . , Xyn) =^ A and p(j/i, . . . , ym) = B also have different ratios of eigenvalues. 
Then take 



/(ri,r2,...,T„) ^p{tiXi +tiyi,...,TmXm +t,nym), 

where Ti = {ti, ti) ^ K^ . The polynomial / is linear with respect to all Ti 

dot/ 



In view of Remark[21 it is enough to show that the ratio ^ i f takes on all values. 



Fix Ti, . . . , T'j_i, rj_)_i, . . . , Tm to be generic pairs where i is such that ^^^^ K is not 
constant with respect to Ti. Such i exist because otherwise all matrices in the image 
(in particular, A and B) have the same ratio of eigenvalues. But -^g^fy is the ratio 
of quadratic polynomials, and K is quadratically closed. 

If there is a point Ti such that tr/ = det f — 0, then / evaluated at this Ti is 
nilpotent. Since tr/ is a linear function, the equation tr/ = has only one root, 
which is a rational function on the other parameters. Thus / evaluated at this Ti 
is 0, by Amitsur's Theorem. We conclude that the ratio of eigenvalues does not 
depend on Ti, contrary to our assumption on i. Hence, we can solve / . K — c for 
any c € K . D 

Lemma 8. // there exist Ai ^ ±A2 with a collection of matrices {Ei, E2, ■ . ■ , Em) 
such that p{Ei, E2, . ■ . , Em) has eigenvalues Ai and X2, then all diagonalizable ma- 
trices lie in Im p. 

Proof. Applying Lemma 2] to the hypothesis, there is a matrix 

Ai 



Q ^ , e Im p, Ai 7^ ±A2 

which is an evaluation of p on matrix units Cij. Consider the following mapping 
X acting on the indices of the matrix units: xi^ij) — ^3-i,3-j- Now take the 
polynomial 

f{Ti,T2,...,T„i) =p{tiXi +tix{xi),.-.,TmXm +tmX{Xm)), 

where Ti — {ti,Ti) £ K^, which is linear with respect to each Ti. Let us open 
the brackets. We obtain 2™ terms and for each of them the degrees of all ver- 
tices stay even. (The edge 12 becomes 21 which does not change degrees, and 
the edge 11 becomes 22, which decreases the degree of the vertex 1 by two and 
increases the degree of the vertex 2 by two.) Thus all terms remain diagonal. Con- 
sider generic pairs Ti, . . . ,Tm G K'^. For each i consider the polynomial fi{T*) = 
/(Ti, . . . , Ti-i,Ti + T*,Ti+i, . . . , Tm). For at least one i the ratio of eigenvalues of 
fi must be different from ±1. (Otherwise the ratio of eigenvalues of fi equal ±1 all 
i, implying Ai = ±A2}, a contradiction.) 
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Fix i such that the ratio of eigenvalues of fi is not ±1. By hnearity, Im (/;) takes 
on values with all possible ratios of eigenvalues; hence, the cone under Im (fi) is 
the set of all diagonal matrices. Therefore by Lemma [2] all diagonalizable matrices 
lie in the image of p. D 

Lemma 9. If p is a multilinear polynomial evaluated on the matrix ring M2{K), 
then Im p is either {0}, K, sh, M2{K), or M2{K) \ K. 

Proof. In view of Lemma [2J we are done unless Im p contains a non-scalar matrix. 
By Lemma [5] the linear span of Im p is 5^2 or M2{K). We treat the characteristic 
2 and characteristic ^ 2 cases separately. 
CASE I: Char K ^2. Consider the set 

Q ~ {p{Ei, . . . , Em) where the Ej are matrix units}. 

If the linear span of the image is not 5^21 then O contains at least one non-scalar 
diagonal matrix DiagjAi, A2}, so Ai ^ — A2 (since +1 — —1). Hence by Lemma|Sl 
all diagonalizable matrices belong to Im p. Thus, Im p contains M2{K) \ K. 

If the linear span of the image of p is SI2, then by Lemma |4] the identity matrix 
(and thus all scalar matrices) and ei2 (and thus all nilpotent matrices) belong to 
the image. On the other hand, in characteristic 2, any matrix SI2 is conjugate to a 
matrix of the form Ai/ + A2ei 2, and we consider the invariant ^. Take a;i, . . . , Xm 
to be generic matrices. If p{xi, . . . ,Xm) were nilpotent then Im p would consist 
only of nilpotent matrices, which is impossible. By Example [Sfv), p(a;i, . . . ^Xm) 
is not scalar and not nilpotent, and thus is a matrix from K. Hence, K C Im p, 
by Lemma [6] Thus, all trace zero matrices belong to Im p. 

CASE II: Char K ^ 2. Again assume that the image is not {0} or the set of 
scalar matrices. Then by Lemma [5] we obtain that ei2 S Im p. Thus all nilpotent 
matrices lie in Im p. If the image consists only of matrices of trace zero, then by 
Lemma [5] there is at least one matrix in the image with a nonzero diagonal entry. 
By Lemma 2] there is a set of matrix units that maps to a nonzero diagonal matrix 

which, by assumption, is of trace zero and thus is p. I . By Lemma [2] and 

Example ini Im p contains all trace zero 2x2 matrices. 

Assume that the image contains a matrix with nonzero trace. Then by Lemma [5] 
the linear span of the image is M2{K), and together with Lemma |3] we have at least 
two diagonal linearly independent matrices in the image. Either these matrices have 
ratios of eigenvalues (Ai : A2) and (A2 : Ai) for Ai ^ ±A2 or these matrices have 
non-equivalent ratios. In the first case we can use Lemma |8] which says that all 
diagonalizable matrices lie in the image. If at least one of these matrices have ratio 
not equal to ±1, then in the second case we also use Lemma [S] and obtain that 
all diagonalizable matrices lie in the image. If these matrices are such that the 
ratios of their eigenvalues are respectively 1 and —1, then we use Lemma [7] and 
obtain that all diagonalizable matrices with distinct eigenvalues lie in the image. 
By assumption, in this case, scalar matrices also belong to the image. Therefore we 
obtain that for any ratio (Ai ; A2) there is a matrix A e Im p having such a ratio 
of eigenvalues. Using Lemmas [2] and [U we obtain that the image of p can be cither 
{0}, K, SI2, M2{K), or M2{K) \K. D 
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Lemma 10. If p is a multilinear polynomial evaluated on the matrix ring M2{K), 
where K is a quadratically closed field of characteristic 2, then Im p is either {0}, 
K, sh, orAhiK). 

Proof. In view of Lemma [U it suffices to assume that the image of p is M2{K) \ K. 
Let cci, . . . , Xm, yi, • ■ • , Vm be generic matrices. Consider the polynomials 

Oi — p\X\^ . . . ^ ^i—1 ; Vi: ^i+1 •)••••) ■^nij • 

Let Pi{xi, . . . , Xm, Vi) = ptT{bi) + tT{p)bi. Hence pi can be written as 

Pi = p{xi , . . . , Xi^i,XitT{bi) + y,;tr(p), Xi+i , . . . , Xm)- 

Therefore Ltl pi C Im p. Also if a G Im pi, then 

tr(a) = tr(ptr(fej) + tr(p) &,) = 2tr(p)tr(&j) = 0. 

Thus, Im Pi consists only of trace-zero matrices which belong to the image of p. 
Excluding K, the only trace zero matrices are nilpotent or scalar. Thus, for each i, 
Pi{xi, . . . , Xm, Vi) is either scalar or nilpotent. However, the pi are the elements of 
the algebra of free matrices with traces, which is a domain. Thus, Piixi, . . . , Xm, Vi) 
cannot be nilpotent. Hence for all i = 1, . . . , to, Pi{xi, . . . , Xm, Vi) is scalar. In this 
case, changing variables leaves the plane (p, /) invariant. Therefore, dim(Im p) — 2, 
a contradiction. D 

Lemma 11. If p is a multilinear polynomial evaluated on the matrix ring M2{K) 
(where K is a quadratically closed field of characteristic not 2), then Im p is either 
{Q),K, sh, orAhiK). 

Remark. Since the details are rather technical, we start by sketching the proof. 
We assume that Imp = Al2{K) \ K. The linear change of the variable in posi- 
tion i gives us the line A + tB in the image, where A = p{xi, . . . ,Xm) and B ~ 
p(x\, . . . , Xi^\,yi,Xi+\, . . . ,Xm)- Take the function that maps t to f{t) = (Ai — A2)^, 
where Xi are the eigenvalues oi A + tB. Evidently 

fit) = (Ai - A2)2 - (Ai + X2f - 4A1A2 = (tr(A + tB)f - 4det(A + tB), 

so our function / is a polynomial of deg < 2 evaluated on entries oi A + tB, and 
thus is a polynomial in t. 

There are three possibilities: Either degj / < 1, or / is the square of another 
polynomial, or / vanishes at two different values of t (say, ti and ^2)- (Note that 
here we use that the field is quadratically closed). This polynomial / vanishes if 
and only if the two eigenvalues oi A + tB are equal, and this happens in two cases 
(according to Lemma O: A + tB is scalar ot A + tB is nilpotent. Thus either 
both A + tiB are scalar, or A + tiB is scalar and A + t2B is nilpotent, or both 
A + tiB are nilpotent. The first case implies that A and B are scalars, which is 
impossible. The second case implies that the matrix A + *^1^*^ B e K, which is 
also impossible. The third case implies that trA = trB = which we claim is also 
impossible. If deg^ / < 1, then for large t the difference Ai — A2 of the eigenvalues 
oi A + tB, is much less than t, so the difference between eigenvalues of B must be 0, 
a contradiction. 

It follows that f(t) = (Ai — A2)^ is the square of a polynomial (with respect 
to t). Thus Ai — A2 = a + tb, where a and b are some functions of the entries of 
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the matrices xi, . . . , Xm,yi- Note that a is the difference of eigenvalues of A and b 
is the difference of eigenvalues of B, Thus 

Note that (Ai — A2)^ = a^ + 2abt + iP't^ which means that a?, b'^ and ab are 
polynomials (note that here we use Char K ^ 2). Thus, |- = ^ is a rational 
function. Therefore there are polynomials pi,p2 and q such that a = piy/q and 
& = Vi^- Without loss of generality, q does not have square divisors. By ^ 
we have that q does not depend on xi and j/i. Now consider the change of other 
variables. The function a is the difference of eigenvalues oi A= p{xi, . . . , Xm) so it 
remains unchanged. Thus q does not depend on other variables also. That is why 
Ai ± A2 are two polynomials and hence Ai are polynomials. One concludes with the 
last paragraph of the proof. 

Proof. According to Lemma |9] it suffices to prove that the image of p cannot be 
M2{K)\K . Assume that the image of p is M2{K)\K . Consider for each variable Xi 
the line Xi + tyi, t £ K. Then p{xi, . . . ,Xi-i,Xi + tyi,Xi^i, . . . ,a;,„) is the line 
A + tB, where p{xi, . . . ,Xm) = A and p{xi,. . . ,Xi-i,yi,Xi+i, . . .,Xm) = B. Thus 
A + tB^ -RT for any t. Since B is diagonalizable, we can choose our matrix units 
Ci.j such that B is diagonal. Therefore 

B = \bI+( n ° V A = \aI 



-c 



Hence 



tc 



( ^ 


y 


v ^ 


— X 


y 


\ 


— X — 


tc 



A + tB = {\A+tXB)I- 

I It* 1 / /~* 7/ \ 

The matrix j is nilpotent if and only if {x + tc)^ + yz ^ 0,, 

which has the solution t\^2 — -(—2; ± ^J~yz). Thus, when yz 7^ 0, i^{A + ij-B) will 
be nilpotent for j = 1, 2, where 7r(X) — X — ifrX. However tr(A-|-iji?) is nonzero 
for one of these values of tj, implying ^ + t^B e iT, a contradiction. 

Thus, we must have yz = 0. Without loss of generality we can assume that 

z = 0. Any matrix M of the type 9^ + ( r, I satisfies det M = q^ — w^ and 

q = itrM. Thus x = J \{tYAY - Aet A and c = J\{tiBY -dctB. Consider the 
matrix 

-f? — C/l Xr> — pyX\^ . . . ^ ^z— 1 , CXi xyi^ ^z-t-l ? • ■ • ? •^ra) 1 

which must be scalar or nilpotent. It can be written explicitly algebraically in terms 
of the entries of Xi and y^. Also, Pi = {c\b — x\a)I + (cy)ei2, where ei2 is the 
matrix unit. There are two cases. If y = then the line A + tB includes a scalar 
matrix, and if y 7^ then (cAb — xXa) — and all matrices on the line A + tB have 
the same ratio of eigenvalues. 

Let Si = {i : Pi £ K} and S2 = {i ■ Pi <E sl2{K)}. Without loss of generality we 
can assume for some k < m that Si — {1, 2, . . . , fc} and {fc + 1, . . . , m}. The four 
entries of p{xi , . . . , Xm) are 

Py(2^i,(i,i)'2:^i,(i,2)ja;i,(2,i),a;i^(2,2),- • ■ ,2^171,(2,2)), 
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polynomials in the entries of Xi . Consider the scalar function 

, . _ ^tT p{xi,...,Xm) 

Jl[^l: ■ ■ ■ : ^m) — p/ n i 

ityXl , . . . , Xjyi) 

where R[xi, . . . ^Xm) = y |:tr^p(a;i, . . . ,0;™) — detp(a;i, . . . , a;„i). This function is 
defined everywhere except for those (xi, . . . , Xm) for whichp(a;i, . . . , Xm) is a matrix 
with equal eigenvalues, because R is the half-difference of eigenvalues. The function 
fi{xi, . . . , Xm) does not depend on Xk+i, ■ • • , Xm because for any i > k + 1, sub- 
stituting yi instead of Xi does not change the ratio of eigenvalues of p{xi, . . . , Xm)- 
Consider the matrix function 



/2(xi,...,X„) 



p{xi,...,Xm) ~ ^tr p{xi,...,Xm) 



ityXi , . . . , Xrn) 

This function is also defined everywhere except for those {xi, . . . ^Xm) such that 
the eigenvalues of p{xi, . . . , Xm) are equal. The function /2(xi, . . . , Xm) does not 
depend on x^, i < k, because for any i < k substituting yi instead of Xi does not 
change the basis in which p{xi, . . . , x^,) is diagonal. R^ is a polynomial: 

R^ = -tr^p{xi,...,Xm) -detp{xi,...,Xm)- 

Write R^ ~ r\r2r'i where r\ is the product of all the irreducible factors in which 
only xi, . . . , Xfc occur, r^ is the product of all the irreducible factors in which only 
Xfc+i, . . . , Xm occur, ra is the product of the other irreducible factors. We have that 

^^ Pl^li ■ ■ ■ T^m) __ r2, \ 

ri{Xl, . . . ,Xm)r2{Xl,. . . ,Xm)r3(Xl, . . . ,Xm) 

does not depend on Xk+i, ■ ■ ■ ,Xm- Therefore if tr^p — (719293 (again in qi only 
xi, . . . ,Xk occur, in 92 only Xk+i, ■ ■ ■ , Xm occur and 93 is all the rest) then ^^^^"^^ 



919293 
does not depend on Xk+i, . . . , Xm- Hence r2 — 92 and r^ = (73 (up to scalar factors). 

As soon as 919293 is a square of a polynomial all 9, are squares therefore r2 and 

2 
r^ are squares. Now consider the function ^. This is the square of the (1,2)- 

entry in the matrix function /2, so it does not depend on xi, . . . ,Xk- Writing 

P12 = 9192 93 (where, again, only xi,. . . ,Xk occur in 91, only Xk+i, ■ ■■ -.Xm occur in 92 

and 93 is comprised of all the rest), then all the qi are squares and 91 = ri, implying 

ri is square. Thus the polynomial rir2r3 — R^ is the square of a polynomial. 

Therefore i? is a polynomial. We conclude that Ai — A2 = 2R is a polynomial (where 

we recall that Ai and A2 are the eigenvalues of p(xi, . . . , Xm))- Ai + A2 = tr(p) is also 

a polynomial and hence Ai are polynomials, which obviously are invariant under 

conjugation since any conjugation is the square of some other conjugation). Hence, 

Ai are the polynomials of traces, by Donkin's Theorem quoted above. Now consider 

the polynomials [p — Ai/) and {p — X2I), which are elements of the algebra of free 

matrices with traces, which we noted above is a domain. Both are not zero but 

their product is zero, a contradiction. D 

Finally, Theorem [5] follows from Lemmas [TUl and [TT] 
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